Search CORE

104 research outputs found

Fragments and hot spots in drug discovery

Author: Kozakov Dima
Vajda Sandor
Whitty Adrian
Publication venue: 'Impact Journals, LLC'
Publication date: 21/07/2015
Field of study

R01 GM064700 - NIGMS NIH HHSPublished versio

Boston University Institutional Repository (OpenBU)

PubMed Central

Improved Modeling of Peptide-Protein Binding Through Global Docking and Accelerated Molecular Dynamics Simulations

Author: Alekseenko Andrey
Kozakov Dima
Miao Yinglong
Wang Jinan
Publication venue: 'Frontiers Media SA'
Publication date: 12/06/2020
Field of study

This work is licensed under a Creative Commons Attribution 4.0 International License.Peptides mediate up to 40% of known protein-protein interactions in higher eukaryotes and play a key role in cellular signaling, protein trafficking, immunology, and oncology. However, it is challenging to predict peptide-protein binding with conventional computational modeling approaches, due to slow dynamics and high peptide flexibility. Here, we present a prototype of the approach which combines global peptide docking using ClusPro PeptiDock and all-atom enhanced simulations using Gaussian accelerated molecular dynamics (GaMD). For three distinct model peptides, the lowest backbone root-mean-square deviations (RMSDs) of their bound conformations relative to X-ray structures obtained from PeptiDock were 3.3–4.8 Å, being medium quality predictions according to the Critical Assessment of PRediction of Interactions (CAPRI) criteria. GaMD simulations refined the peptide-protein complex structures with significantly reduced peptide backbone RMSDs of 0.6–2.7 Å, yielding two high quality (sub-angstrom) and one medium quality models. Furthermore, the GaMD simulations identified important low-energy conformational states and revealed the mechanism of peptide binding to the target proteins. Therefore, PeptiDock+GaMD is a promising approach for exploring peptide-protein interactions

KU ScholarWorks

How proteins bind macrocycles

Author: Beglov Dmitri
Chennamadhavuni Spandan
Kozakov Dima
Porco John A.
Vajda Sandor
Villar Elizabeth A.
Whitty Adrian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2014
Field of study

The potential utility of synthetic macrocycles (MCs) as drugs, particularly against low-druggability targets such as protein-protein interactions, has been widely discussed. There is little information, however, to guide the design of MCs for good target protein-binding activity or bioavailability. To address this knowledge gap, we analyze the binding modes of a representative set of MC-protein complexes. The results, combined with consideration of the physicochemical properties of approved macrocyclic drugs, allow us to propose specific guidelines for the design of synthetic MC libraries with structural and physicochemical features likely to favor strong binding to protein targets as well as good bioavailability. We additionally provide evidence that large, natural product-derived MCs can bind targets that are not druggable by conventional, drug-like compounds, supporting the notion that natural product-inspired synthetic MCs can expand the number of proteins that are druggable by synthetic small molecules.R01 GM094551 - NIGMS NIH HHS; GM064700 - NIGMS NIH HHS; GM094551 - NIGMS NIH HHS; R01 GM064700 - NIGMS NIH HHS; GM094551-01S1 - NIGMS NIH HH

Boston University Institutional Repository (OpenBU)

PubMed Central

Efficient maintenance and update of nonbonded lists in macromolecular simulations

Author: Bajaj Chandrajit
Beglov Dmitri
Chowdhury Rezaul
Kozakov Dima
Moghadasi Mohammad
Paschalidis Ioannis Ch.
Vajda Sandor
Vakili Pirooz
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/10/2014
Field of study

Molecular mechanics and dynamics simulations use distance based cutoff approximations for faster computation of pairwise van der Waals and electrostatic energy terms. These approximations traditionally use a precalculated and periodically updated list of interacting atom pairs, known as the “nonbonded neighborhood lists” or nblists, in order to reduce the overhead of finding atom pairs that are within distance cutoff. The size of nblists grows linearly with the number of atoms in the system and superlinearly with the distance cutoff, and as a result, they require significant amount of memory for large molecular systems. The high space usage leads to poor cache performance, which slows computation for large distance cutoffs. Also, the high cost of updates means that one cannot afford to keep the data structure always synchronized with the configuration of the molecules when efficiency is at stake. We propose a dynamic octree data structure for implicit maintenance of nblists using space linear in the number of atoms but independent of the distance cutoff. The list can be updated very efficiently as the coordinates of atoms change during the simulation. Unlike explicit nblists, a single octree works for all distance cutoffs. In addition, octree is a cache-friendly data structure, and hence, it is less prone to cache miss slowdowns on modern memory hierarchies than nblists. Octrees use almost 2 orders of magnitude less memory, which is crucial for simulation of large systems, and while they are comparable in performance to nblists when the distance cutoff is small, they outperform nblists for larger systems and large cutoffs. Our tests show that octree implementation is approximately 1.5 times faster in practical use case scenarios as compared to nblists

Boston University Institutional Repository (OpenBU)

PubMed Central

FigShare

Improved prediction of MHC-peptide binding using protein language models

Author: Boran Hao
Dima Kozakov
Dima Kozakov
Dima Kozakov
Ioannis Ch. Paschalidis
Ioannis Ch. Paschalidis
Ioannis Ch. Paschalidis
Mikhail Ignatov
Mikhail Ignatov
Nasser Hashemi
Pirooz Vakili
Sandor Vajda
Sandor Vajda
Sandor Vajda
Publication venue: Frontiers Media S.A.
Publication date: 01/08/2023
Field of study

Major histocompatibility complex Class I (MHC-I) molecules bind to peptides derived from intracellular antigens and present them on the surface of cells, allowing the immune system (T cells) to detect them. Elucidating the process of this presentation is essential for regulation and potential manipulation of the cellular immune system. Predicting whether a given peptide binds to an MHC molecule is an important step in the above process and has motivated the introduction of many computational approaches to address this problem. NetMHCPan, a pan-specific model for predicting binding of peptides to any MHC molecule, is one of the most widely used methods which focuses on solving this binary classification problem using shallow neural networks. The recent successful results of Deep Learning (DL) methods, especially Natural Language Processing (NLP-based) pretrained models in various applications, including protein structure determination, motivated us to explore their use in this problem. Specifically, we consider the application of deep learning models pretrained on large datasets of protein sequences to predict MHC Class I-peptide binding. Using the standard performance metrics in this area, and the same training and test sets, we show that our models outperform NetMHCpan4.1, currently considered as the-state-of-the-art

Directory of Open Access Journals

Analysis of Binding Site Hot Spots on the Surface of Ras GTPase

Author: Buhrman Greg
Kearney Bradley M.
Kovrigina Elizaveta A.
Kovriguine Evgueni
Kozakov Dima
Mattos Carla
Napoleon Raeanne
O\u27Connor Casey
Vajda Sandor
Zerbe Brandon
Publication venue: e-Publications@Marquette
Publication date: 01/11/2011
Field of study

We have recently discovered an allosteric switch in Ras, bringing an additional level of complexity to this GTPase whose mutants are involved in nearly 30% of cancers. Upon activation of the allosteric switch, there is a shift in helix 3/loop 7 associated with a disorder to order transition in the active site. Here, we use a combination of multiple solvent crystal structures and computational solvent mapping (FTMap) to determine binding site hot spots in the “off” and “on” allosteric states of the GTP-bound form of H-Ras. Thirteen sites are revealed, expanding possible target sites for ligand binding well beyond the active site. Comparison of FTMaps for the H and K isoforms reveals essentially identical hot spots. Furthermore, using NMR measurements of spin relaxation, we determined that K-Ras exhibits global conformational dynamics very similar to those we previously reported for H-Ras. We thus hypothesize that the global conformational rearrangement serves as a mechanism for allosteric coupling between the effector interface and remote hot spots in all Ras isoforms. At least with respect to the binding sites involving the G domain, H-Ras is an excellent model for K-Ras and probably N-Ras as well. Ras has so far been elusive as a target for drug design. The present work identifies various unexplored hot spots throughout the entire surface of Ras, extending the focus from the disordered active site to well-ordered locations that should be easier to target

epublications@Marquette

PubMed Central

Docking Server for the Identification of Heparin Binding Sites on Proteins

Author: Beglov Dmitri
Beglova Natalia
Kozakov Dima
Mottarella Scott E.
Nugent Matthew A.
Vajda Sandor
Publication venue: 'American Chemical Society (ACS)'
Publication date: 13/07/2015
Field of study

Many proteins of widely differing functionality and structure are capable of binding heparin and heparan sulfate. Since crystallizing protein–heparin complexes for structure determination is generally difficult, computational docking can be a useful approach for understanding specific interactions. Previous studies used programs originally developed for docking small molecules to well-defined pockets, rather than for docking polysaccharides to highly charged shallow crevices that usually bind heparin. We have extended the program PIPER and the automated protein–protein docking server ClusPro to heparin docking. Using a molecular mechanics energy function for scoring and the fast Fourier transform correlation approach, the method generates and evaluates close to a billion poses of a heparin tetrasaccharide probe. The docked structures are clustered using pairwise root-mean-square deviations as the distance measure. It was shown that clustering of heparin molecules close to each other but having different orientations and selecting the clusters with the highest protein–ligand contacts reliably predicts the heparin binding site. In addition, the centers of the five most populated clusters include structures close to the native orientation of the heparin. These structures can provide starting points for further refinement by methods that account for flexibility such as molecular dynamics. The heparin docking method is available as an advanced option of the ClusPro server at http://cluspro.bu.edu/

CiteSeerX

Harvard University - DASH

Recommended from our members

A First-Generation Multi-Functional Cytokine for Simultaneous Optical Tracking and Tumor Therapy

Author: Dash Rupesh
Elbayly Elizabeth
Figueiredo Jose-Luiz
Fisher Paul B.
Hall David
Hingtgen Shawn
Kasmieh Randa
Kozakov Dima
Nesterenko Irina
Sarkar Devanand
Shah Khalid A.
Vajda Sandor
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Creating new molecules that simultaneously enhance tumor cell killing and permit diagnostic tracking is vital to overcoming the limitations rendering current therapeutic regimens for terminal cancers ineffective. Accordingly, we investigated the efficacy of an innovative new multi-functional targeted anti-cancer molecule, SM7L, using models of the lethal brain tumor Glioblastoma multiforme (GBM). Designed using predictive computer modeling, SM7L incorporates the therapeutic activity of the promising anti-tumor cytokine MDA-7/IL-24, an enhanced secretory domain, and diagnostic domain for non-invasive tracking. In vitro assays revealed the diagnostic domain of SM7L produced robust photon emission, while the therapeutic domain showed marked anti-tumor efficacy and significant modulation of p38MAPK and ERK pathways. In vivo, the unique multi-functional nature of SM7L allowed simultaneous real-time monitoring of both SM7L delivery and anti-tumor efficacy. Utilizing engineered stem cells as novel delivery vehicles for SM7L therapy (SC-SM7L), we demonstrate that SC-SM7L significantly improved pharmacokinetics and attenuated progression of established peripheral and intracranial human GBM xenografts. Furthermore, SC-SM7L anti-tumor efficacy was augmented in vitro and in vivo by concurrent activation of caspase-mediated apoptosis induced by adjuvant SC-mediated S-TRAIL delivery. Collectively, these studies define a promising new approach to treating highly aggressive cancers, including GBM, using the optimized therapeutic molecule SM7L

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

VCU Scholars Compass

FigShare

Protein–protein docking by fast generalized Fourier transforms on 5D rotational manifolds

Author: Kazennov Andrey
Kholodov Yaroslav
Kozakov Dima
Mottarella Scott
Padhorny Dzmitry
Porter Kathryn,
Ritchie David
Vajda Sandor
Xia Bing
Zerbe Brandon,
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 13/05/2016
Field of study

International audienceEnergy evaluation using fast Fourier transforms (FFTs) enables sampling billions of putative complex structures and hence revolutionized rigid protein–protein docking. However, in current methods, efficient acceleration is achieved only in either the translational or the rotational subspace. Developing an efficient and accurate docking method that expands FFT-based sampling to five rotational coordinates is an extensively studied but still unsolved problem. The algorithm presented here retains the accuracy of earlier methods but yields at least 10-fold speedup. The improvement is due to two innovations. First, the search space is treated as the product manifold SO(3)×(SO(3)∖S1), where SO(3) is the rotation group representing the space of the rotating ligand, and (SO(3)∖S1) is the space spanned by the two Euler angles that define the orientation of the vector from the center of the fixed receptor toward the center of the ligand. This representation enables the use of efficient FFT methods developed for SO(3). Second, we select the centers of highly populated clusters of docked structures, rather than the lowest energy conformations, as predictions of the complex, and hence there is no need for very high accuracy in energy evaluation. Therefore, it is sufficient to use a limited number of spherical basis functions in the Fourier space, which increases the efficiency of sampling while retaining the accuracy of docking results. A major advantage of the method is that, in contrast to classical approaches, increasing the number of correlation function terms is computationally inexpensive, which enables using complex energy functions for scoring

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PubMed Central

Protein folds vs. protein folding: Differing questions, different challenges

Author: Chen Shi-Jie
Hassan Mubashir
Jernigan Robert L.
Jia Kejue
Kihara Daisuke
Kloczkowski Andrzej
Kotelnikov Sergei
Kozakov Dima
Liang Jie
Liwo Adam
Matysiak Silvina
Meller Jarek
Micheletti Cristian
Mitchell Julie C.
Mondal Sayantan
Nussinov Ruth
Okazaki Kei-ichi
Padhorny Dzmitry
Rose George D.
Skolnick Jeffrey
Sosnick Tobin R.
Stan George
Vakser Ilya
Zou Xiaoqin
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 29/12/2022
Field of study

KU ScholarWorks